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Abstract: A heuristic greedy algorithm is developed for efficiently tiling spatially dense redshift 
surveys. In its first application to the Galaxy and Mass Assembly (GAMA) redshift survey we find 
it rapidly improves the spatial uniformity of our data, and naturally corrects for any spatial bias 
introduced by the 2dF multi object spectrograph. We make conservative predictions for the final state 
of the GAMA redshift survey after our final allocation of time, and can be confident that even if worse 
than typical weather affects our observations, all of our main survey requirements will be met. 
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1 Introduction 

Large redshift surveys are typically completed by ob- 
serving with a Multi Object Spectrograph (MOS), ob- 
taining spectra for many hundreds of sources simul- 
taneously over large fields of view. The problem of 
how to optimise observing strategies to target sources 
distributed over some survey area with a given MOS, 
defining a field of view and number of simultaneous 
targets, falls into the "area packing" class of prob- 
lems. Much work outsi de of astronomy has been de - 
voted to such problems dMegiddot fc Supowits II1984T ) 
which are usually intractable in a formal, provably- 
optimal, sense. In the case of the Anglo-Australian 
Telescope's (AAT) largest survey to date , the 2 degree 
Field Galaxy Redshift Survey (2dFGRS. IColless et all 
2001), the survey was created in a manner that min- 
imised field overlaps in order to maximise area (the 



target magnitude limit being bj = 19.45). This obvi- 
ously had an impact on the target completeness, and 
the observations had to be weighted in order to ac- 
count for the local levels of incompleteness. At the 
other ex treme is the 6 d egree Field Galaxy Survey 
(6dFGS, Ijones et all 120041 1 that aimed for high levels 
of completeness within the local universe. In this case 
the filamentary structures (i.e. non-uniform overdensi- 
ties) present on small scales necessitate extremely non- 
uniform tile coverage and potentially large amounts of 
overlap among tiles, target densities varying from 6 to 
30 galaxies per deg 2 . Hence the optimal strategy for 
tiling is closely linked to the scientific objectives of the 
survey, and a generic approach will not be appropriate 
for all requirements. 

Fibre fed MOS instruments typically have a cir- 
cular field of view ( FOV), as seen for example in the 
2 degree Field f2dF. lLewis et al.ll2002T ). 6 degree Field 
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C6dF. |Jones et al.ll2004h. Sloan Digital Sky S urvey (SPSS) 
Spec trograph jYork et al.l200d). Hect ospec i|Fabricant et al 
l2005h and Hvdra l|Barden et al.lfl993T ). Also typical is 
for survey regions to be rectangular in spherical coordi- 
nate geometry: recent examples include the 2dFGRS 
Sloan Digital Sky Survey fSDSS. lAbazaiian et al.1200 1 
and Millennium Galaxy Catalogue fMGC. iLiske et*lil 
2003; [Driver et ail 12003:). This latter commonality is 
due to a number of allying factors: imaging CCDs used 
for input catalogues are almost always rectangulaiQ 
and survey boundaries and volumes are easier to con- 
sider when using spherical coordinate derived edges. 
Packing a shape best described in spherical coordinates 
into a Cartesian defined region is a non trivial task, and 
many approaches have been used in redshift surveys. 
Such packing problems are of wider mathematical in- 
terest because no provably o ptimal and rapid tech- 
nique has yet been discovered fMegiddot fc Supowits I 
1984). Instead every large survey tailors a tiling method 
in line with specific survey goals using a heuristic method 
In this sense a heuristic method is one informed by 
knowledge of the problem at hand, the hope being the 
solution is not much worse than optimal. On top of 
the generic problem of efficient tile packing, spectro- 
scopic surveys also have to contend with extremely 
non-uniform and complex selection functions within 
the tiles themselves. The major cause for the non- 
uniformity is object exclusion, either due to fibre col- 
lisions or slit overlaps. 

In the case of 2dFGRS an approach close to hexag- 
onal packing was used, where slight perturbations were 
made to a purely hexagonal grid of tile centres in or- 
der to better sample object densities. Since this survey 
was almost single pass (there was ~ 30% tile overlap), 
low completeness fields were not uncommon, an effect 
that was statistically adjusted for with observational 
weights. However, in the densest fields some targets 
will not have redshifts, and galaxy group assignments 
will not be as secure as in highly complete regions. 
The downside of such a regular approach is that all 
multi-fibre spectrographs will have structure or bias in 
their assignments and thus power will be added to (or 
removed from) certain frequencies in tangential modes 
of the power spectrum. The distribution of fibres is not 
only driven by the algorithm used to place them, but 
also the physical limitations of the instrument. Typi- 
cally a fibre fed MOS is designed with fibres around the 
circumference in such a way that all fibres can reach 
the centre and few can reach locations at the edge, a 
scenario that makes radially-dependent targeting dis- 
tributions inevitable. Even with the newest simulated 
annealing (SANN) algorithms available for AAOmega, 
radial ass ignment dependencies within each 2dF point- 
ing exist (jMiszalski ct al. 2006). It is obviously impor- 
tant to try to compensate for such biases in any work 
that is concerned with clustering and st ructure, such 
as G alaxy And Mass Assembly (GAMA, iDriver et"ai1 
2009), the latest large survey to use AAOmega on the 
A AT. 

The spectroscopic element of SDSS (Bla nton et al.l 

l2003h used a heuristic algorithm that attempted to 



The use of GALEX in WiggleZ ( Glazcbroo k et al.l 
2007) is a rare counter-example. 



find an acceptable solution of a perturbed uniform grid 
]of tiles, much like 2dFGRS. The algorithm aimed to 
utilise 90% of the 600 available fibres on each tile, and 
similar to 2dFGRS the SDSS's median tile coverage for 
an object was 1 (both achieved a target density ~100 
galaxies per square degree). Minimum fibre spacings 
are 55" for the SDSS spectrograph, larger than the 40" 
distance for 2dF, thus an obvious limitation of SDSS is 
the full targeting and unbiased analysis of close pairs 
(a key science objective for GAMA, discussed in detail 
below). 

Of recent surveys, the VIMOS VLT Deep Survey 
(VVD S from here) utilise d the simplest approach to 
tiling l|Bottini et all 120051 ). Effectively it placed tile 
centres on a fixed square grid with diagonal offsets 
used for the deeper component of the survey. Such 
an approach is possible when using VIMOS because of 
its mask-based grism spectrograph, giving it a square 
FOV better suited to tiling a square CCD photometric 
survey. The VVDS does not suffer from any radial se- 
lection bias, but due to the constraints imposed by slits 
cut into each mask it does possess complex selection 
effects such as the tendency to target a uniform spread 
of targets; highly clustered regions are hard to target 
since the slits necessarily avoid each other. Further 
complicating matters is a partially radial completeness 
bias, eviden t in the spectrosco pic masks created for 
zCOSMOS (|Knobel et alj|2009h . Whilst an interest- 
ing survey to note, such a survey design is not trivial 
to create with any of the fibre based multi object spec- 
trographs discussed due to their circular FOV, and the 
complex radial bias this introduces. 

Simulated annealing solutions of the tiling prob- 
lem have been utilised in large area surveys with large 
amounts of structure present , most notably by the 
6dFGS i|Campberl et all 120041 '). Simulated annealing 
is a popular approach for many algorithmically insolv- 
able problems and is, strictly speaking, a metaheuris- 
tic solution (i.e. choices have to be made about the 
element to be optimised and also the method of opti- 
mising). In simple terms the user must pick some- 
thing to be minimised (or maximised), such as the 
total number of objects not assigned to a fibre after 
tiling the whole survey region. The user must also 
give the SANN algorithm variables to perturb (most 
obviously the right ascension and declination of the tile 
centres), and a rate at which it 'cools' towards a so- 
lution. Typically these perturbations become smaller 
as the solution improves, and eventually an acceptable 
set of tile positions should be found. Packing prob- 
lems lend themselves well to SANN since they can be 
tuned to find acceptable solutions rapidly, but they are 
non-deterministic algorithms (unlike the other heuris- 
tic approaches discussed) and are neither provably op- 
timal nor stable (i.e. small variations to the problem to 
be optimised can produce radically different results). 
In the case of the 6dFGS, SANN is obviously much 
more effective than any sort of regular tiling because 
the projected target densities vary significantly and 
the survey area is large. The use of SANN reduces the 
number of sparsely populated fields and better samples 
overdensities where fields would be full. 

Added to the complexities of these different ap- 
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proaches are the observational limitations for any sur- 
vey as well as its scientific priorities. It will not be 
the case that all fields are equally observable in a large 
area survey (e.g. varying rising and setting time as a 
function of RA), but in a sufficiently small area survey 
it will often be the case that all parts of the survey 
field are effectively as observable as each other. Also, 
the end point of the survey will often be an unknown 
(i.e. weather dependent), so in many applications it is 
advantageous for the survey to be in a useable state 
as quickly as possible. With these extra considera- 
tions in mind, the philosophy that was applied to tiling 
GAMA was one where each tile would in some sense be 
the next most optimal tile, and every subsequent tile 
should make a significant impact towards achieving the 
GAMA survey requirements. 

The GAMA redshift survey is one component of 
the multi-band GAMA survey project, and is the lat- 
est large survey to use the AAT's MOS facility. In this 
paper we explore the problems of tiling specifically for 
the GAMA survey, with the possibility of using the 
approaches discussed in future redshift surveys with 
characteristics in common with GAMA. In section 2 
we outline the GAMA survey, and how the scientific 
goals for the project translate into survey requirements 
that our tiling algorithm must achieve. In section 3 
we discuss in detail the different options to tiling that 
are appropriate for GAMA. In section 4 we apply the 
two most likely candidates for the tiling algorithm to 
the GAMA survey as it was left at the end of year 1, 
allowing quantitative judgements of the different ap- 
proaches to be made. In section 5 we apply our chosen 
tiling algorithm to the data and present the state of 
the survey after year 2. Finally, conservative predic- 
tions are made for the state of the survey after year 3 
observations based on tiling simulations. 



2 The GAMA Survey 

The GAMA project is a multi-band imaging and spec- 
troscopic survey containing just under 144 square de- 
grees of sky in three nearly identical 12° x 4° areas 
centred on 9 h +1°; 12 h +0° and 14 h 30 m +0° (known 
as GAMA 09 or G09, GAMA 12 or G12 and GAMA 
15 or G15). Future expansion to include two South- 
ern 8° x 6° regions, to meet the survey requirements 
for m easuring the halo mass function dDriver et al.l 
2009), is part of the design consideration. One of 
these southern regions may also be the focus of Aus- 
tralian Sqjiaxe_J^iIometre_Array Pathfinder observa- 
tions (| Johnston et alj 120081 ) in the proposed DINGO 
programme. Eventually all regions will be fully cov- 
ered in FUV, NUV, u, g, r, i, z, Y, J, H, K and 
far-IR, and will utilise imaging data from the SDSS, 
UKIRT Infrared Deep Sky Survey (UKIDSS), VLT 
Survey Telescope (VST), Visible and Infrared Survey 
Telescope for Astronomy (VISTA), GALEX and the 
Herschel Space Observatory. This imaging dataset is 
being complemented by a three-year redshift survey 
using the AAOmega spectrograph at the Anglo Aus- 
tralian Telescope (AAT). Observations allocated dur- 
ing 2008 (year 1) and 2009 (year 2) have been com- 



pleted, with a third allocation of observing time during 
2010 (year 3) remaining. The 2008 observations were 
made using a different approach to tiling (as discussed 
in detail below), and the tiling algorithms discussed 
here continue from the state it was left in then. 

The GAMA survey is the latest in a long line of 
large galaxy surveys using the AAT to obtain redshifts 
(e.g. 2dFGRS and MGC) , and is primarily designed to 
measure the halo mass function (HMF) , with other sci- 
entific goals including an investigation of close pairs of 
galaxies (i.e. merging systems) and a fully dust cor- 
rected description of the galaxy luminosity function 
(LF) from the far-UV to the far-IR, along with the 
associated galaxy stellar mass function. The GAMA 
redshift survey aims to be exceptionally complete over 
the three large areas of sky described above. This re- 
quires careful planning in order to maximise the sci- 
entific output of the AAO mega instrument used to 
measure galaxy spectra (see lSharp et al.ll2006l . and the 
AAOmega websit^3 f° r details) . 

In the case of the spectroscopic component of the 
GAMA survey, the requirement is for extremely high 
levels of completeness for all objects that are within 
our sample selection. This requires repeated observa- 
tions for all areas of the survey, and thus tile place- 
ments become increasingly non-regular as the survey 
progresses in order to successfully target residual over- 
densities that appear. 

The tiling algorithm used for GAMA must achieve 
a number of scientific goals (which have been trans- 
lated into survey requirements) assuming conservative 
assumptions regarding observing time lost to weather. 
GAMA has strict primary targets, chosen so as to max- 
imise our scientific return, and secondary goals to aim 
for upon completion of these. 

2.1 Survey Requirements 

Listed below are the primary survey requirements, which 
should be achieved by the end of the third year of ob- 
servations at the AAT (assuming typical time loss due 
to bad weather and equipment failure). All references 
to completeness refer to the fraction of targets assigned 
at least one fibre compared to the number of objects 
in the input catalogue of targets. This does not mean 
all of these objects will eventually have redshifts (typi- 
cally only 90-99% of targets return a redshift), or that 
all of the targets are galaxies (e.g. our star/galaxy sep- 
aration is not perfect, see Baldry et al. 2009 submitted 
for details). 

• Flux limit: Fibre assignments for 99% of tar- 
gets with r ve tro < 19.4 in G09 and G15 and 
rpetro < 19.8 in G12. Also Kab < 17.5 and 
Zmodei < 18.2 in all three GAMA regions. For 
later reference, objects that satisfy at least one 
of these magnitude limits are main survey tar- 
gets. 

GAMA aims to be 99% complete (or better) in terms 
of targeting for these three survey bands in each region. 

2 www.aao.gov.au/ A AO / 2df/ aaomega/ 
aaomega_manuals.html 
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The r-band limits account for 114,780 of the 119,859 
galaxies that meet these combined flux limits (95.7%). 
Of the remainder 4,079 are provided by the K-band 
limit, with only 1,000 galaxies introduced to our sam- 
ple by the addition of the z-band limit. The r-band 
limit was defined by our scientific goals for GAM A, 
and is a compromise between depth (deeper surveys 
have more objects per square degree), time available 
given the area GAMA is covering (only so many galax- 
ies can be targeted) and the probable S/N we can ex- 
pect with AAOmega (redshift success rates drop off 
as a function of magnitude). The Tf-band limit was 
adopted to improve the quality of GSMFs obtained 
with GAMA, and was the deepest possible that kept 
the total number of required redshifts within achiev- 
able bounds. Finally the z-band limit was introduced 
because it is the reddest band available in SDSS and 
should ensure completeness in r and K for low surface 
brightness galaxies. For further details on the exact 
target selection u sed for GAMA refer to iBaldrv et al.l 
(12009. submitted! ). 

• Spatial completeness: 99% of each region to be 
at least 80% targeting complete on the angular 
scale of 0.14°. 

In order to improve the halo mass function to signifi- 
cantly lower masses than previously probed it is impor- 
tant that we have both high overall completeness (as 
defined above), as well as high levels of completeness 
on small spatial scales. Since the structures of interest 
are groups and clusters, the comoving physical scale of 
interest is ~ IMpc, and at z = 0.1 (typical for high 
confidence systems) this subtends ~ 0.14° (projected 
comoving distance when Ho=71 kms -1 Mpc -1 assum- 
ing Q m ~ 0.3 and Q,a=0-7)- For reliable estimates of 
velocity dispersions, and indeed for structures to be 
identified in the first place, a large fraction of poten- 
tial members must have redshifts. In the case of very 
low mass groups (the type that we are most interested 
in) we require at least two redshifts to attempt a veloc- 
ity dispersion (in the strictest sense this is true for the 
same reason we can measure the standard deviation 
of 2 data points, but more data are required to mea- 
sure the velocity dispersion confidently). 80% com- 
pleteness means our expectation for a 3 object system 
is 2 or more redshifts, and 4 redshifts in a 5 object 
system. The desire that this level of spatial complete- 
ness is achieved in 99% of each GAMA region is one 
of practicality, 100% is obviously desirable, but 99% is 
acceptable (i.e. we would not miss too many groups). 

• Pair completeness: Fibre assignments for 99% of 
galaxies within 40" of another galaxy. 

Another scientific goal for GAMA is to thoroughly ex- 
plore the merger rate of galaxies out to z — 0.5. Since 
merging systems will necessarily be close on the sky, 
this obviously requires high levels of redshift complete- 
ness for galaxies with small angular separations. The 
value of 40" was chosen since this is the separation 
at which fibre collisions on 2dF become a significant 
issue. Measuring closely clustered objects on scales 
smaller than this limit is potentially difficult and must 
be approached as part of the primary survey observing 
strategy. 



2.2 Extended Survey Goals 

• Flux limit uniformity: Every 0.1 magnitude bin 
99% redshift complete for the magnitude limits 
given above. 

Since redshift completeness is a function of flux (it is 
harder to obtain reliable redshifts for fainter objects) 
care should be taken so that our sample is not pref- 
erentially biased to brighter galaxies. This is a much 
harder target than achieving 99% overall completeness, 
and since the effect can be corrected for later this is 
only considered to be a secondary survey goal. Should 
observing progress successfully, and assuming the re- 
quirements discussed above have been met, this could 
be an important survey goal in the latter stages. 

• All galaxies should be observed with —2 h < HA 
< 2 (where HA is the hour angle). 

Whilst it is desirable that every galaxy is observed at 
zenith for the entirety of the integration period, this 
is obviously not possible. A sensible constraint for 
GAMA is that all objects should be observed within 
2 h of the meridian in order to keep the air-mass down, 
but in exceptional conditions this constraint may have 
to be omitted for reasons of practicality. It is gener- 
ally true to say that when one of G09, G12 or G15 is 
observable all galaxy positions are equally acceptable 
within a region, the exceptions being at the extreme 
of our allowed HA range. 

• Reobservation of all targets for which we failed 
to obtain a redshift. 

A large fraction of redshift failures will be caused by 
effects unrelated to the true viability of a target. For 
instance partial cloud cover during observation or fibre 
positioning errors both conspire to reduce the amount 
of flux entering a target fibre, and since the chance of 
obtaining a redshift is proportional to the S/N this 
will mean fainter objects are more likely to be classed 
as a failed target. So as not to introduce any unwanted 
targeting bias to the GAMA survey, we ideally should 
observe all failed targets at least twice. As well as 
giving the object a chance to be observed in a more 
favourable plate position and better weather condi- 
tions, we can use the summed integration time even 
if S/N is low in the reobservation. Thus our redshift 
survey should be minimally biased by flux. 

2.3 GAMA Survey to Date 

Beyond achieving the requirements and goals stated 
above, a complicating factor for the tiling algorithm 
to be used is that it must continue the GAMA sur- 
vey from how it was left at the end of the first year of 
observations. Due to tight time constraints, the year 
1 tiling of GAMA was implemented using a simplistic 
gridding system where each region was divided into 
three rows and eight columns, with the divisions being 
lines of longitude and latitude in spherical coordinates. 
Each vertical box-edge was adjusted in right ascension 
(RA) until all boxes in a row contained a similar num- 
ber of targets, then objects were extracted into 2 sep- 
arate catalogs containing half the targets each. The 
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aim of the first year was to try to observe each box 
twice, a feat that was nearly achieved due to three ex- 
tremely successful weeks of observations at the A AT. 
Whilst this returned a fantastic number of redshifts 
(~ 51,000) it became apparent that the distribution 
of objects with redshifts betrayed clear signs of their 
gridded origin; an effect of the configuration routine 
for the 2dF. This routine, known as CONFIGURE, is 
supplied to observers at the AAT in order to convert 
lists of desired targets into valid fibre locations on the 
2dF, and is the closest interfac e observers have to th e 
eventual distribution of fibres (Miszalski et al. 2006). 




0.0 0.2 0.4 0.6 0.8 1.0 



Radius/ 

Fi gure 1 '. The radial bias of the simulated annealing algo- 
rithm used in the AAT 2dF CONFIGURE software. 300 sim- 
ulations of a random uniform 2dF region were configured 
with the tiling software, where 600 targets were randomly 
assigned a priority level between 1 and 9 (higher num- 
ber indicates higher priority), and 378 fibres were working. 
All densities are weighted by area, thus no radial gradient 
would be a uniform distribution in this plot. All combined 
priority levels are plotted (black line) as well as all priority 
levels from 2-9 (blue-red). A rectangular density kernel 
was used with a bandwidth of 0.02. Horizontal dotted line 
denotes the uniform distribution. Vertical dotted lines de- 
note regions beyond which edge effects render the densities 
meaningless because the bandwidth samples outside of the 
physical limits of the 2dF. 



Whilst the newest versions of CONFIGURE (a GAMA- 
specific version 7.10+ was used throughout) offers vast 
improvements over older routines, and produces much 
less pronounced spatial features, it still possesses a 
clear radial gradient. Evidence of this gradient can be 
found in Figure [T] This plot shows the probability of 
targets obtaining a fibre as a function of distance from 
the 2dF centre. As well as demonstrating the general 
tendency for a random set of targets to have a cen- 
tral bias (the black line in the plot), different CONFIG- 
URE priority levels were investigated separately, where 
a higher number (maximum of 9, minimum of 1) in- 



dicates the simulated annealing algorithm tries harder 
to put a fibre on a target. Radial effects are not ev- 
ident, or are very small, for high priority levels, but 
it is clear from Figure [1] that the radial distortion be- 
comes extremely noticeable for low priority objects for 
the simulations conducted here. 

The effect of the algorithm is to return a more 
uniform distribution for the highest priority targets, at 
the expense of lower priorities. The result of this fibre 
assignment gradient is that given a region that has an 
even distribution of targets within the FOV, objects in 
the centre, especially those assigned a low priority, are 
more likely to be allocated a fibre than similar priority 
targets near the edge. This is an almost unavoidable 
effect since many more fibres are able to reach central 
targets. At the extreme, an object exactly in the centre 
of a field is reachable by all 392 fibres (400 minus the 
8 guide fibre bundles), but one at the extreme edge 
of the field (directly in front of a fibre) might only by 
reachable by 1. 

In the example presented here, all priority 5 targets 
and higher could have been assigned a fibre in theory. 
This means that purely by virtue of assigning fibres 
to a large fraction of these targets a close to uniform 
distribution is assured, and hence the gradient is much 
more evident for priority level of 4 and below. As a 
guide to the gradient expected if all targets possess the 
same priority, the combined distribution is the most 
indicative (black line). Thus assigning all targets to a 
high priority will not eliminate the gradient, but the 
most undesirable features will always affect the lowest 
priority targets more. 

The impact of such a radial selection function on 
data which is gridded in a Cartesian manner should be 
clear: corners are under-sampled compared to all other 
regions. This effect was exacerbated in the first year 
GAMA data because gridded subsets were observed 
twice. Figure[2]is a plot of local completeness, showing 
the fraction of main survey targets observed inside a 
circular top-hat of radius 0.14° (the local completeness 
scale stated in the survey requirements). The central 
light strip in G12 is due to a deeper survey limit for this 
region, r pet ro < 19.8 here compared to r pe t ro < 19.0 or 
Tpetro < 19.4 for all other targets in year 1 (the use of 
these limits is discussed in detail below). Ignoring this 
strip, the next obvious feature is periodicity in com- 
pleteness, demonstrating the clear Cartesian residual 
embedded in the data after the year 1 strategy. This 
coherent regular structure is due to radial effects in 
the CONFIGURE software. The most obvious features 
are long, highly complete regions that are at the same 
declination in GAMA 09 and GAMA 15 (the central 
strip in GAMA 12 is by design, as discussed above). 

Running orthogonally to these strips in right as- 
cension are periodic strips in declination. Due to the 
target boxes being shuffled in right ascension, these 
strips do not necessarily span the full range of decli- 
nation, but they are particularly obvious at the top of 
G09 and G12, and the bottom of G15. The extremely 
blue (incomplete) regions are those not visited during 
GAMA year 1 — the reason these regions are not per- 
fectly blue is because various older surveys (e.g. 2dF- 
GRS and SDSS) already provide redshifts for a small 
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Figure 2: The state of the GAMA regions after the first 
year of data. The plots describe survey redshift complete- 
ness inside a circular top-hat kernel with a diameter of 
0.14° This was chosen since it is the angular extent of 
a 1 Mpc system at z ~ 0.1 assuming a ACDM cosmol- 
ogy and Hq=71 kms — 1 Mpc — , and thus represents the 
group/cluster scale. Blue through to red represents 0%— 
80% completeness, whilst black through to white represents 
80%-100% completeness. One of the main survey goals is 
that 99% of the pixels in each GAMA survey area are 80% 
complete, i.e. this plot is 99% grey-scale. 



fraction of GAMA targets here. 

As well as needing to consider issues regarding the 
removal of non-cosmic structure from our completeness 
map, the year 1 GAMA survey was conducted with dif- 
ferent magnitude limits to those now required. These 
were used in order to increase the scientific return from 
the first year of spectroscopic data, and should not neg- 
atively impact the survey from this point. The major 
difference from the GAMA survey requirement mag- 
nitude limits stated above is that only an r-band pet- 
rosian magnitude was used, and the limit was r pe t ro < 
19.0 in G09 and G15, and a mixture of r pe tro < 19.0 
and r pe tro < 19.4 in G12 (due to the excellent weather 
G12 was extended in overall depth midway through 
year 1), with the addition of the deeper strip in G12 
limited to r pe tro < 19.8 (this strip is obvious in Figure 
[2j. Since there are two more years of observations to 
be made this selection effect should not be difficult to 
compensate for in the long term, and part of the rea- 
son our first extended survey goal is to achieve equal 
redshift completeness as a function of magnitude. 



3 Tiling Options Explored 

In algorithmic terms the approach desired for tiling 
GAMA from its p ost year 1 state is a t ype of heuristic 
greedy algorithm ijCormen et al]ll99Ch . where the tile 
about to be put down maximises some property of the 
survey, and in the longer term the task of tiling is 
not made too much harder by this greediness. Such 
an approach is both desirable and possible due to the 
extremely high object density required for the GAMA 
survey. This means the problem is contrary to the 
type applicable to low spatial density redshift surveys 
(e.g. the 6dF survey) because on average the number 
of 2dF tiles placed on a given area will be extremely 
high (conservative estimates suggest every position will 
be contained within at least six separate tiles), rather 
than deliberately low (i.e. minimally packed). 

On a slightly separate issue, because the GAMA 
survey is particularly interested in low mass halos it is 
absolutely essential that highly clustered objects are 
attacked in an aggressive manner. There are con- 
straints on this process however since the 2dF has 
physical limitations on how close together fibres can be 
placed. This problem can be solved by repeatedly ob- 
serving clustered regions, and making sure the "worst 
offending" objects in clustered regions are observed as 
early as possible in order to achieve the tiling require- 
ments. 

The issue of tiling is interwoven with the prob- 
lem of assigning fibres to targets on the 2dF. The pro- 
gram used for assigning fibres on the 2dF instrument 
(configure) has been continually upgraded since its 
introduction 10 years ago, and now the algorithm of 
choice is based on simulated annealing of the fibre al- 
locations. Whilst this approach offers massive advan- 
tages over the older Oxford and Taylor algorithms (see 
iMiszalski et alj2006l for details), it is non-deterministic. 
Every time the configuration is attempted a different 
solution will almost certainly be found (a feature to be 
added to CONFIGURE is the option of setting the ran- 
dom seed, but a small perturbation in the input file 
will still create a radically different solution). Since a 
typical configuration time is of the order 10 minutes it 
is computationally challenging to incorporate the fibre 
assignments into a long term optimisation approach 
to tiling, be this SANN or quasi-Newton BFGS (Broy- 
den, Fletcher, Goldfarb and Shanno) optimisation of 
the tile positions (for a disc ussion of multidimensiona l 
optimisation algorithms see lNocedal fc Wright 1 12006). 
For sparse surveys such as 6dF, where there is little 
tile overlap for the most part, this will not present 
such a problem since a given object is typically only 
in one tile, but for GAMA it rapidly impacts on the 
efficiency of the tiling. The other issue with optimising 
for all of the tile positions in a survey such as GAMA 
is that it offers no insight into where it would be best 
to place the next tile since this would mean optimis- 
ing for N t u e \, where Nui e is the number of tiles (i.e. 
every possible tile ordering). For 50 tiles this would 
mean ~ 10 64 full survey configurations, and this is 
assuming the tile positions are already optimal. This 
makes the problem highly intractable computationally, 
instead the standard approach (e.g. 6dF) is to make all 
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potential tiles an equally good option. 

When the survey area is extremely large (i.e. only 
a small fraction of it is observable at a given moment) 
producing a large number of equally good target fields 
makes a lot of sense since it is hard to predict ex- 
actly which region will be within the required zenith 
distance range when observations start, the typical ad- 
vised limit on a given field being ±2 h (hence this be- 
ing a survey goal). In the case of GAM A, a 2dF tile 
can be placed in any part of the survey sub-regions, 
so we are free to place the next tile in the most opti- 
mal position. Since the longest a GAMA region can 
be observed for whilst remaining inside the hour-angle 
limits is 4 h 48 m (the extra 48 m comes from the RA 
length of each GAMA region) , the next GAMA region 
will always be at a smaller (more desirable) hour-angle 
before we are limited by RA within the current region. 
The exceptions to this are the first and last fields of 
the night, where it might be necessary to limit our ob- 
servations to the survey region extremes in order to 
maximise observation time. 

Bearing in mind these competing factors the final 
matter that must be decided is what aspect of the 
survey should be improved with each tile used. The 
two most obvious possibilities, based on the survey 
requirements discussed in the previous section, were 
the number of redshifts obtained (hereafter referred to 
as greedy), and the spatial completeness of the survey 
(hereafter referred to as dengreedy). The former case 
would simply involve determining which region of the 
survey has the greatest number of high priority targets 
within a two degree FOV, regardless of any other infor- 
mation. This would be the crudest type of greedy algo- 
rithm, in the mathematical sense, because all each tile 
cares about is where the densest collection of targets 
is. The reason this could become too crude is because 
even at the mid stage of the survey there will be mul- 
tiple places in the survey region that contain far more 
targets within a 2dF tile than there are fibres, and 
whilst each of these tile locations would improve the 
total completeness of the survey by the same degree, 
they will not necessarily improve the spatial complete- 
ness by the same amount. The greedy algorithm might 
accidentally pick the location that improves the spa- 
tial completeness the most, but only a small fraction 
of the time. Hence always placing the tile centres at 
the densest point might be too greedy given our survey 
requirements. 

The dengreedy approach of improving the spatial 
completeness is slightly more subtle. It works by choos- 
ing tile centres based on which location in the survey 
(when sampled with the 2dF) is the least spatially 
complete, regardless of how many targets are avail- 
able. Whilst sounding potentially disastrous, allowing 
the tile centres freedom regardless of the number of tar- 
gets works very effectively. Given the 2 degree FOV of 
the 2dF, the large scale structure of the universe in- 
troduces relatively small variations in the homogene- 
ity of our target galaxies. By design, spatial optimi- 
sation achieves angular completeness faster than the 
purely greedy approach, but it does typically return 
fewer redshifts after a given number of tiles. Since 
the main scientific goal of GAMA is to measure the 



halo mass function for very low mass systems, which 
requires high spatial completeness, this is not necessar- 
ily a terrible compromise. It should be noted that den- 
greedy still generally favours regions missing the most 
redshifts (given the local variability of the large scale 
structure), but since the algorithm works specifically 
to level the spatial completeness it will often find quite 
different tile position solutions given the same survey 
state. 
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Figure 3: Comparison of different tiling approaches. Sim- 
ulated annealing (SANN), greedy and dengreedy approaches 
to tiling are simulated on identical data. 22,000 objects are 
randomly distributed inside a 12° X 4° area and 350 objects 
(at most) are removed each time a simulated observation 
is completed. The plot shows the cumulative difference in 
objects extracted from the maximum possible as a function 
of tile number. If all possible tiles are observed the opti- 
mal type of tiling is a variety of simulated annealing (Ideal 
SANN), but this is only more efficient when removed tar- 
gets are predictable and nearly every expected tile is used 
(Real SANN performs significantly worse). For list of pri- 
orities used in the GAMA survey, see Tabled 



A simplistic comparison of greedy, dengreedy and 
SANN (the most implementable type of full tile posi- 
tion optimisation when the number of free parameters 
is large since it is resistant to local minima) is made 
in Figure [3] This shows the cumulative difference be- 
tween the total number of targets acquired and the 
maximum possible as a function of tile number. 22,000 
objects were randomly generated in an area the same 
size as a GAMA region; this number chosen because it 
is roughly the number of objects left to target in G12. 
Since the tiling imprint dominates the target object 
structure rapidly, a uniform distribution of targets is 
adequate for comparative purposes. The actual CON- 
FIGURE program is not used (this would be too time 
consuming), instead 350 objects are randomly removed 
from a 2 degree FOV (without replacement) from the 
survey area for each tile, and the plot shows the cumu- 
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lative difference in objects targeted as a function of tile 
number. The ideal simulated annealing (Ideal SANN) 
removes objects from each tile in a consistent manner 
(to simulate the output of configure being predictable), 
and also has a specified number of tiles to use (65). 
greedy and dengreedy on the other hand attempt to 
improve the total survey completeness or spatial com- 
pleteness as much as possible with each subsequent 
tile. 

To reflect how the random distribution of targets 
produced by CONFIGURE can affect the efficiency of 
simulated annealing, a variation of this tiling was made 
(Real SANN) which uses the same tile positions as 
Ideal SANN, but randomly selects 350 objects. This 
removal of objects is also done without replacement, 
but it is non-deterministic and thus will not return 
the same object assignment solution as Ideal SANN. 
Clearly this has a significant impact on the efficiency of 
the tiling, and it means simulated annealing goes from 
being the most effective approach (when targeted ob- 
jects are deterministic and nearly every tile generated 
is used) to the worst. Interestingly, dengreedy achieves 
higher levels of completeness than greedy towards the 
end of these simulations. These are simple compar- 
isons, but they do highlight the issue of how simulated 
annealing will find good solutions only when the in- 
puts to the problem are precisely known. If a survey 
finishes a few tiles sooner than expected due to bad 
weather (a realistic prospect for many surveys) then 
the gains brought by SANN are lost, and equally if 
there is a non-deterministic black-box contained within 
the problem to be optimised (in this case configure) 
then the solution could be far from optimal. 

Since the number of tiles remaining for the GAMA 
spectroscopic survey is unknown, and the small sur- 
vey area lends itself well to observing the next best 
position at nearly all times, a decision was made at 
an early stage to concentrate efforts on investigating 
the greedy and dengreedy algorithms. This means ap- 
proaches that attempt to optimise for all tile positions 
(in this case SANN, but includes any type of multi- 
dimensional optimisation routines such as BFGS) will 
not be discussed further since they cannot truly opti- 
mise for a non-deterministic configuration routine and 
an unknown number of remaining tiles. The other 
weakness of total survey optimisation is that it can- 
not properly compensate for the subtle effects of fibre 
targeting gradients discussed in the previous section, 
whilst a tile-by-tile type of optimisation will contin- 
ually make small adjustments based on exactly these 
effects. 

3.1 Which Type of Greedy? 

To determine the optimal position of the next tile both 
the greedy and dengreedy approaches were investigated 
thoroughly. The greedy algorithm will simply choose 
the tile location that has the most main survey tar- 
gets within it, for instance in G09 this would be in the 
centre of the unobserved region in the top-right (see 
Figure [5J. The dengreedy algorithm, however, would 
not pick exactly the same location. Because it con- 
volves the targets with the full two degree FOV, the 
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Figure 4: The spatial completeness of the GAMA regions 
after the first year of data. The plots describe survey com- 
pleteness inside a circular top-hat kernel with a diameter 
of 2°. See Figure [2] for further details of the completeness 
metric. 



least complete point in the survey tends to be at the 
extreme edge of the survey region when there is a large 
incomplete area. This is clear in Figure [3] where the 
GAMA incompleteness maps have been convolved with 
the full 2 degree FOV. The most incomplete point in 
G09 is the extreme top-right corner when considered 
in this manner. 

It was realised that allowing the field centres to 
move to such extremes would, in the long term, be 
detrimental to the survey. The most serious concern is 
that too few objects might be selected to use ~ 100% 
of the available fibres, and even if there were plenty of 
targets in the field, the Cartesian geometry could re- 
duce the fraction of targets successfully assigned. The 
obvious solution is to put mild limits on how close 
to the survey edge the tile centres are allowed to be, 
effectively limiting the part of the survey that can pro- 
vide a minimum in the completeness map. Simulations 
were conducted on G09 to ascertain the ideal distance 
to use, the results suggesting that any buffer between 
0.3°-0.5° improves the tiling efficiency (survey require- 
ments met faster), and 0.4° appeared to be about opti- 
mal (survey requirements obtained 2 tiles faster than 
without a buffer). These buffer zones should not be 
enforced when the number of targets remaining is very 
small (hundreds within a GAMA region) because the 
extreme region edges will often be the best place to 
place a tile. 

Such a positional limitation is not necessary for the 
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• algorithm because it will rarely be the case that 
more targets will be contained within a FOV at a re- 
gion edge than slightly inset. Generally a greedy tile 
centre will be nearly 1° from a survey edge to maximise 
the number of targets within. These subtle effects can 
be seen in the plots of Figure [5] which show the po- 
sitions of tile centres using both the greedy and the 
dengreedy approaches for the tiling metric using sur- 
vey buffers (the tile centres inside the buffer zone are 
due to the caveats discussed above). The greedy algo- 
rithm generally positions tiles much further inside the 
survey limits, the average distance of each tile from the 
centre of GAMA 09 is 3.54° for greedy and 3.66° for 
dengreedy. The consequence is that there is more over- 
lap between tiles using greedy, and that it takes longer 
for every part of GAMA 09 to have been contained 
within a two degree FOV once. Both plots show the 
positions of the tiles that bring the survey complete- 
ness up to 99%, which in these simulations happens 
to occur after 48 tiles for both greedy and dengreedy 
(run to run, the exact number of tiles will differ due 
to the random nature of the simulated annealing used 
in configure). 

The major advantage of using dengreedy over greedy 
is in the latter stages of the survey when approach- 
ing high total completeness (remembering our require- 
ment is 99%). As a qualitative example, whilst the 
greedy algorithm is naturally biased towards large clus- 
ters that are missing, say, 20% of potential members, 
dengreedy will be drawn towards less dense regions con- 
taining numerically poor groups missing, say, 25% of 
potential members. Whilst the large cluster may be 
missing more objects in total, its dynamics will al- 
ready be reliably measurable at 80% completeness. 
The more tenuous small groups require very high lev- 
els of completeness to confidently apply grouping algo- 
rithms (e.g. Friends-of- Friends) , and in order to con- 
struct the halo mass function down to exciting new 
levels it is these systems that are the key. As should 
be expected, dengreedy achieves our spatial complete- 
ness targets (99% of the survey area is locally at least 
80% complete) faster than greedy (46 tiles, compared 
to 48, in these simulations). Figure|S]demonstrates the 
the long term superiority of the dengreedy algorithm 
clearly. When we are close to the end of the survey 
(within 15 tiles) dengreedy returns consistently better 
total and spatial completeness. This means should our 
survey be extremely hindered by bad weather or tech- 
nical problems, the data set will be much more com- 
plete. Based on this reasoning, the tiling algorithm 
that we selected for continuing the GAMA spectro- 
scopic survey was dengreedy. 



4 Tiling Algorithm Implemen- 
tation 

Having chosen dengreedy as our tiling method, we must 
now consider a number of issues that can significantly 
impact the efficiency of our survey regardless of the 
tiling algorithm to be used. 
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Figure 5: Plots demonstrating the differing distribution 
of 2dF tiles when GAMA 09 has achieved 99% complete- 
ness, using both the greedy (top) and dengreedy (bottom) 
approaches for the tiling metric. The dotted line, in both, 
plots indicates a 0.4° tile centre buffer. It is clear that the 
greedy algorithm typically positions tiles a large distance 
from the survey edge, whilst dengreedy often places tiles 
right up to the survey buffer limit. Both approaches con- 
centrate tiles on the least complete regions of GAMA 09 (as 
seen in Figure [2}, hence the large number in the top-right 
region of GAMA 09. dengreedy produces better packing, 
which translates to less overlap between tiles. 



4.1 Priority Bumping 

Since one of our survey requirements is high complete- 
ness for close pair targets, an issue that had to be ad- 
dressed was fibre collisions hindering the rate at which 
clusters can be maximally sampled. Typically two fi- 
bre buttons can be no closer together than 40" (the 
actual exclusion geometry is more complex, but this 
is deemed an appropriate estimate on the AAOmega 
website), which at z = 0.2 (approximately the me- 
dian redshift of GAMA) corresponds to 131 kpc. A 
compact group might have numerous galaxies closer 
together than this distance, even ignoring projections 
that render any system more closely packed when ob- 
served. Added to this, one of the primary scientific 
goals of GAMA is an analysis of merging galaxies and 
close pairs, so placing fibres on a large fraction of such 
pairs is vital. The only way to overcome the problem 
of fibre collisions is by re-observation of the same re- 
gion of sky, a certainty in the GAMA survey. Thus in 
order to observe clustered regions as efficiently as pos- 
sible, an aggressive approach to close pair targeting 
was used. 
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Fi gure 6; Plot comparing greedy and dengreedy. The y- 
axis shows the relative total and spatial completeness: the 
greedy completeness divided by the dengreedy completeness. 
When this ratio is greater than 1 the greedy algorithm is do- 
ing a superior job, and the reverse is true when the ratio is 
below 1. The data is plotted up to tile 48 (when both algo- 
rithms achieve the required spatial completeness). Whilst 
much of the early tiling favours the greedy algorithm, den- 
greedy is clearly doing a better job of improving spatial and 
total completeness when we are within 15 tiles of the sur- 
vey's end. 



For each tile generated a collision matrix of all the 
main survey targets is created. From this the worst 
offending target (i.e. target that is within 40" of the 
most other targets) is found, and its priority level is 
increased by 1. This makes it much more likely the 
CONFIGURE program will place a fibre on it in the tile 
being created. Furthermore, in order to improve the 
chances these colliding targets are successfully assigned 
a fibre, all the objects that they are interfering with 
are removed from the list of potential targets for the 
tile being made. This last step is important since all 
the highest priority targets would otherwise be in re- 
gions that are difficult to configure, and the simulated 
annealing algorithm will often cool to a solution be- 
fore a large fraction of these targets are assigned a 
fibre. With the worst offending target bumped up one 
level of priority and the interfering targets removed, 
the next worst offending collider is found and the pro- 
cess repeated until no objects closer than 40" remain 
in the sample of interest. 

By following this process for every tile made, usu- 
ally 100% of the highly colliding targets are removed 
each time, and consequently as the survey approaches 
high levels of completeness we are not left with pock- 
ets of targets that require multiple configurations. The 
effectiveness of this aggressive approach to targeting 
clusters is clear from simulations conducted for the 
GAMA 09 region: using priority bumping means 99% 



spatial completeness is achieved with 46 tiles (from 
the survey state at the end of year 1 using dengreedy), 
however if no priority bumping is used this same level 
of completeness typically requires 2-3 more tiles. Ob- 
viously the local spatial completeness considers angu- 
lar regions much larger than the 40" collisions being 
targeted by the priority bumping, but the long term 
rewards of the approach seem clear. 
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Fi gure 7; The fraction of fibre assignments to potential 
targets for different priority levels. 300 Monte-Carlo simu- 
lations were made, where 600 objects were uniformly dis- 
tributed in spherical coordinates within a 2 degree FOV 
and assigned a priority level between 1 and 9. There were 
378 fibres available for each configuration. The dotted line 
indicates that priorities 5 and above always within the high- 
est priority 378 objects, so in theory these higher priorities 
could all be complete. The error bars indicate the 15.9% 
and 84.1% quartiles for the assignment fractions from the 
300 simulations, so reflect la errors. 



4.2 Priority Levels 

When constructing input files for CONFIGURE (files with 
the .fid extension) care must also be taken with how 
priorities are assigned to targets. Figure [T] demon- 
strated how the highest priority targets are also those 
with the least radial bias, whilst Figure [7] shows that 
even when there are plenty of fibres available, higher 
priority targets will obtain better completeness. The 
main GAMA survey was awarded priority levels of 6, 
7 and 8. Priority 6 objects are main survey objects 
that have been observed once but a redshift was not 
obtained. Since weather conditions and the location 
of the target on the 2dF drive redshift success rates 
(fibre placement errors occur as a function of tile posi- 
tion, hence S/N and redshift success), it is prudent to 
observe such failures more than once, and these come 
back into the target list at a lower priority than the 
unobserved objects. Priority 7 objects are main survey 



www.publish.csiro.au/journals/pasa 



11 



targets that have not been observed and that are not 
highly clustered, or priority 6 objects that are highly 
clustered and have had their priority bumped. Priority 
8 is reserved for highly clustered priority 7 targets that 
have had their priority level bumped up. Priority 9 is 
reserved for spectral standards (only 3 per field) and 
emergency additions — although this back-up function- 
ality was not required. To guarantee that every fibre 
is used (and to make headway on any deeper redshift 
survey in the same region), filler targets were created 
and these targets were assigned lower priorities. The 
full list of priority levels and object types for year 2 
onwards can be found in Table [TJ 

The priorities assigned to targets were different be- 
tween the two years. In year 1, the targets consisted 
only of the r-band selection with r ps f — r mo d e i > 0.25 
(there was insufficient UKIDSS coverage at the time), 
without an already known redshift. The priorities were 
from high-to-low: (i) r < 19.0; (ii) 19.0 < r < 19.8 in 
G12 within ±0.5° of the celestial equator (creating the 
central strip clear in Figure [2}; (iii) 19.0 < r < 19.4 in 
G09 and G15, and remaining 19.0 < r < 19.8 in G12. 
In addition, clustered targets in any of these categories 
were given a higher priority. 



Priority 


Object Type 


9 


Spectral Standards 


8 


Clustered P7 


7 


Main Survey/ Clustered P6 


6 


Failed Main Survey 


2-5 


Filler Targets 



Table 1: Priority table. Main survey targets are within 
the main GAMA regions and have r < 19.4 for G09/G15, 
r < 19.8 for G12, or K AB < 17.5 or z model < 18.2 for any 
region. The priority 2-5 filler targets use r < 19.8 for G09 
and G15, and has any one of g model < 20.6, r model < 19.8 
or imode.1 < 19.4. Selected fillers also cover an extended 
survey area using the main survey magnitude limits, the 
GAMA regions beoming 14° x4.5° strips. Also used as filler 
objects are objects that either have poor quality AAOmega 
spectra, or are missing it altogether because the redshift 
comes from an older survey. 

To create a configuration input file 600 targets are 
drawn from the input catalogue. This number was cho- 
sen since it allows enough overhead for every fibre to 
be used, whilst remaining small enough to keep config- 
uration times down. To achieve this number of targets 
in the .Ad file all the priority 8 targets within the FOV 
are extracted, if there are more than 600 then a ran- 
dom sample of 600 is taken, if there are less than 600 
then all of them are put into the .fid file. Assuming, 
for example, there are 150 priority 8 objects, then 450 
spaces remain in the file. Next all the available priority 
7 objects are extracted, again if there are fewer than 
450 all of them are used, or else a random sample of 
450 is taken and used. This process is repeated down 
to the priority level that can fill all the remaining slots, 
or until all targets within the FOV have been used. In 
practice the former condition is always reached first. 

After extensive testing it was decided that prior- 
ity 6,7 and 8 targets would be used to determine the 



locations of tiles, where all three priority levels carry 
the same weighting when calculating the completeness 
within a 2 degree FOV for dengreedy. This means ob- 
jects that have spectra, but were not of high enough 
S/N to obtain a reliable redshift, are allowed to influ- 
ence the positions of the tiles. This seems reasonable 
when considering that the redshift success rate within 
a field can reach 100% when the seeing and weather are 
ideal, but drop considerably when conditions worsen, 
so in order to not introduce a temporal bias these red- 
shift failures should be re-observed and be allowed to 
drive the tiling metric. 

4.3 Field Positioning 

When 5 or more fibres are not assigned, despite there 
being 600 potential targets, the central coordinates of 
the tile are moved to a more favourable position (we 
find this situation occurs for ~ 25% of tiles). The most 
successful approach is to take the median right ascen- 
sion and declination of all targets, and use this as the 
new tile centre. This overcomes the effects of unusual 
geometries (even with the region buffer, corners can be 
a problem), without allowing outliers to unduly influ- 
ence the tile centre. If a shift in tile centre is required, 
then the survey buffer is no longer used (hence the 
small fraction of field centres inside the buffer region 
in Figure [SJ). 

The final adjustment to the tiling algorithm is an 
option to force a tile to lie within a certain RA range 
of the GAMA region about to be observed. This This 
may be necessary at the start or end of a night when 
the only observable GAMA region is still at high air- 
mass. When actually observing this meant the first 
field of the night had to be within the low RA 16 m of 
GAMA 09, and the last field of the night had to be 
within the high RA 16 m of GAMA 15. 

4.4 Survey Selection Function 

To some degree the greedy algorithms discussed, and 
the dengreedy algorithm used, will allow a selection 
function for the survey to be calculated; the algorithm 
is both simple and reproducible. However, due to con- 
tinuous feedback from failed observations (typically 
due to bad weather) the survey will never be trivial to 
reproduce from start to finish. In the case of GAMA, 
the algorithm was implemented from a partially com- 
pleted state, further complicating the calculation of a 
full selection function. 

Ultimately, varying instrument efficiency (especially 
over multiple years), seeing, throughput as a function 
of plate position and weather will all conspire to make 
the true selection unknown, and any retrospective cal- 
culation an approximation. This is even assuming 
CONFIGURE behaves in a perfectly predictable way, but 
the variations in fibre assignments (especially with the 
addition of object feedback) will produce highly diver- 
gent tile allocations in the latter stages of the survey. 
As an example, when running simulations discussed 
above multiple runs will produce identical tile centres 
for the first 20 tiles, but small deviations in coordinate 
positions begin to appear beyond this point. By the 



12 



Publications of the Astronomical Society of Australia 



last few tiles of the survey the distribution of targets 
can differ entirely. This is indicative of the complex, 
and unavoidable, interplay between fibre distributions 
on plates and plate distributions on the sky, and clearly 
a perfect selection function is limited by the precise be- 
haviour of CONFIGURE. 

GAMA aims to overcome the worst aspects of an 
uncertain selection function by achieving unprecedented 
levels of completeness, as defined in multiple ways. If 
100% (or near to it) target completeness is achieved 
then all our survey statistics will be heavily dominated 
by cosmic (or sample) variance rather than our selec- 
tion function. 

5 GAMA Survey Progress and 
Predictions 

In year 2 107 fields were observed (from a possible 154) , 
which is slightly better than the median return at the 
AAT for that time of year, and from these 31,836 good 
quality (Q > 3) redshifts were obtained. This is a lot 
less than in year 1, but largely due to unavoidable fac- 
tors (weather effects and instrument downtime). Also, 
fainter magnitude limits were used for year 2 targets 
(r < 19.4 in GAMA 09 and GAMA 15 for year 2 com- 
pared to r < 19.0 for year 1), which obviously affects 
the average S/N and lowers the redshift success rate. 

Due to a mixture of observational constraints and 
a keenness to progress one field to the point where 
halo mass function science is possible, GAMA 09 had 
40 of these fields, GAMA 12 had 42 whilst GAMA 15 
only had 23. The spatial completeness maps for each 
GAMA region after the completion of the second year 
GAMA observations are shown in Figure [5] 

It is clear from these plots that GAMA 09 is the 
nearest to achieving the spatial completeness target for 
GAMA. In fact GAMA 09 is just over 95% complete 
for the main survey after year 2, and over 93% spatially 
complete (using the earlier definition of what fraction 
of the region achieves 80% local completeness). G12 is 
83% complete for the main survey and 66% spatially 
complete. G15 is 82% complete for the main survey 
and 65% spatially complete. 

From the current state of the GAMA survey for 
all 3 regions, it is possible to make quite accurate pre- 
dictions about how the survey will appear after the 
third and final year of observations assuming partic- 
ular weather losses. The expectation at the AAT is 
that there is a 2/3 probability of a given field being 
successfully observed. Due to the observational con- 
straints of the survey we expect ~ 154 fields to be 
observed (this was the field limit for the GAMA year 
2 time allocation, due to fitting observations around 
dark-time the number of year 3 fields will differ). As- 
suming a binomial distribution for the probability of 
fields being observed, the median number of successful 
fields we expect in year 3 is 103 (slightly less than the 
number obtained in year 2). We define "weather mi- 
nus 1 sigma" to be the number of tiles for which the 
integrated binomial distribution is equal to the inte- 
grated normal distribution from -co to — la (0.159): 
this equates to 97 tiles. Based on similar logic we can 
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Figure 8: The state of the GAMA regions after the sec- 
ond year of data. See Figure [2] for further details of the 
completeness metric. 



calculate that year 2 had +0.5<r weather, and year 1 
and year 2 combined had better than +5a weather 
(mostly due to the near perfect weather during year 1). 
Using these numbers for available tiles, we can make 
reasonable, conservative, predictions for the final state 
of the first 3 years of the GAMA survey. 

To achieve the hardest survey requirement of 99% 
completeness in each GAMA region will take a total of 
71 more fields (in practice a few more will be required 
when bad weather failures are fed back in). This is 
well inside even the weather — la limit, and requires 
12 more tiles for G09, 32 more for G12 and 27 more for 
G15. Figure[5]shows what the local completeness maps 
for each region will look like at the exact point all our 
main survey targets are achieved. Encouragingly even 
GAMA 12 looks uniformly complete, despite the deep 
strip created during year 1 making the distribution of 
tiles (and fibres) much less random in declination. 

Figure [TO] shows the completeness maps for each 
GAMA region assuming weather —la (i.e. 97 more 
tiles), thus is a fairly conservative estimate of the fi- 
nal state of the survey. This would provide 20 tiles 
for G09, 42 tiles for G12 and 35 tiles for G15 if simi- 
lar completeness is desired in all three GAMA regions. 
It is not unrealistic for the hardest survey goal to be 
reached either — the requirement that every 0.1 mag- 
nitude Petrosian r-band bin is 99% complete. This 
is predicted to take 110 more tiles, and would require 
a slightly better run than experienced in year 2 (107 
tiles). In terms used previously, this would require 
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Figure 9: The predicted state of the GAMA regions when 
all the major survey targets have been achieved. This re- 
quires 12 more tiles for G09, 32 more tiles for G12 and 27 
more tiles for G15. See Figure [2] for further details of the 
completeness metric. 
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Figure 10: The predicted state of the GAMA regions 
assuming we experience weather — la for year 3. This pro- 
vides 20 more tiles for G09, 42 more tiles for G12 and 35 
more tiles for G15 (i.e. 97 tiles are predicted, and these are 
distributed according to available targets). See Figure[2]for 
further details of the completeness metric. 



weather +1.3<r. 

Since bad weather is often highly correlated, it is 
optimistic to treat the chance of consecutive fields be- 
ing observed as independent events (as required for a 
true binomial distribution). However, even in the ex- 
treme event of half the fields being lost due to bad 
weather we will still achieve our primary survey re- 
quirements, and even one week of perfect weather should 
be enough to bring all three GAMA regions over the 
95% completeness mark. 

Figure[TT]shows how the completeness of each GAMA 
region varies as a function of r pe t ro , where fainter tar- 
gets are less complete. Achieving a uniform level of 
99% completeness over all magnitudes is clearly a harder 
task than achieving 99% global completeness, and since 
redshift success depends on apparent magnitude (brighter 
objects have a higher success rate) we should also ex- 
pect the failure rate to increase from that experienced 
so far. This will mean extra fields will be required 
to obtain safe redshifts, and these numbers should be 
considered lower limits for survey predictions. 

6 Summary 

This work has demonstrated that a greedy approach 
to tiling proves to be extremely successful in densely 
packed surveys such as GAMA. By aggressively tar- 
geting under-densities with each field used, high levels 



of spatial completeness should be a reality for each 
GAMA region by the end of the third year of obser- 
vations. In the meantime we allow for a simple mech- 
anism to feed redshift failures back in, and by priori- 
tising highly clustered regions we obtain both a large 
number of close pairs, and guarantee we are not left 
with difficult pockets of galaxies in the final stages of 
the survey. Further still, by utilising every non-main 
survey fibre on deeper targets, we ensure efficient use 
of the 2dF instrument, and make a head-start on any 
future extended redshift surveys in the GAMA regions. 
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